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A microanalytical procedure for determination of the base composition of DNA 

Marc COEN'E and Carlo COCITO 

Microbiology and Genres Unit. Institute of Cell Pathology. University of Louraia Medical School. Brussels 
(Received February 21 April 22. 1935) - EJB 35 0176 

v,i.«T ? rocedure . ,br determination of the percentage guanine plus cytosine (% C + C: mol/lOOmol) 
values of mtcroquantmes of DNA is described. Its principle is a DNA-polvmerase-l-di ected nick ,Tan ation o 
DNA in «he presence rof dGTP. dTTP. [ J H]dCTP.and [,.»PJdATP. Kinetics experiments indict phwu 
aiue reached m about 20 nun of incubation under our experimental conditions. Percentage G + C f obttSd 
from the hnearrelauon l/(% G + C) = 0.01 *("P]:['H] + O.Ol.wheretheratioof trichloroacetic -ad tp editable 

pairs). U SU al methods for % G + C analysis (buoyant density and melting temperature de rmin 2 ) yi d 
unrehable results ,n the presence ofeuher modtT.ed or unusual bases: the double-labcline procedure stil Sid 
under these condtt.ons The latter is. therefore, the method of choice for an.lv* of rare DNA specie Sac 
available ,n very small quantum (it requires amounts of DNA as low as 1 ne. i.e. several order of maenitude 
lower than those used for chromatographic analysis of DNA hydrolysates). Since the obtention of hiehly purified 

d ° Ub,C - |abe "'^ ■»»«"«■ ■ ^ purification of bacte^l DNA 



Base composition is an essentia! parameter for char- 
acterization of DNA molecules. Owing to the homologous 
base pairing A-Tand G-C. theguanine-plus-cvtosine content, 
mol 100 moi DNA (% CrC value) is a distinctive trait for 
double-stranded DNA. The taxonomic significance of G + C 
values is universally accepted, and determination of base 
content is widely used in bacterial classification [1]. Tolerance 
ranges of 10% G + C for a bacterial genus, and of 3% G-fC 
lor a species are widely accepted. 

Base composition ran be determined by quantitative 
analysis of components (nucleotides, nucleosides, bases) 
obtained by fractionation of DNA hydrolysates: this proce- 
dure is wearisome and requires large amounts of material. 
Conversely % G + C values can be obtained either from the 
buoyant density, which is related to the former bv a linear 
relation o = 1 .660 + 0.098 (G + C) [2, 3J. or from the thermal 
denaturation midpoint (/ m ). to which the linear relation t m = 
69.3 + 0.41 (G + C) applies [4-7]. In turn, buoyant density 
can be obtained from the refractive index of the CsCl solution 
according to the g 25 c = I0.860///6 5 c — 13.4974 [8]. 

Buoyant density and / m determinations are easier and 
-more sensitive procedures, as compared to analysis of DNA 
hydrolysates. Their main limitation is that the occurrence of 
either modified or unusual bases (modified bases are the result 
of postpolymerization processes, whereas unusual bases are 
synthesized by special metabolic pathways) alters the linear 

Correspondence to C. Cocico. UCL 7449. Avenue Hippocraie 75 
B- 1 200 Brussels. Belgium 

Ahhreviatiuns. % G-i-C. guanine plus cytosine content of DNA 
f mo! 100 mol): midpoint of optical absorption melting curve. 

Enzymes. DNA poll, deo.xynucleotidetriphosphate: D~NA deoxv- 
niicleotidWiransferase (EC 2.7.7.7): RNase. ribonucleuse A (EC 
-■-.'.:..:»: l>M.i*>nur. mucopcptide .V-acctvlmuramovlhvdrolase (EC 
3. 11.17). 



relations linking the three colligative properties of double- 
stranded DNA. Thus, for example, virus 2C DNA has appar- 
ent G+C values of about 80% accordins to its buoyant den- 
sity value in CsCl. and of 15% from the established t m value, 
whereas base analysis of DNA hydrolysates yields 38% G + C 
(9 - 12]: such a discrepancy has "been related to a replacement 
of thymine with hydroxymethyluracil. Similar disagreement 
among colligative properties has been reported for the DNA 
containing other unusual bases [13]. Recently a spectro- 
tluorimetric method for bacterial DNA base composition has 
been described [14], Unfortunately, till now nothing is known 
about the applicability of this technique to superceded DNA 
and to DNA with unusual bases. 

The present work relates a procedure for analysis of G + C 
content of double-stranded DNA. which is based on double 
labelling with one precursor in A-T pair and one precursor 
in G-C pair. The range of application of this method has been 
established by analyzing DNA of known composition and of 
widely different G + C values. More specifically it will be 
shown that this sort of technique applies equally well to DNA 
carrying unusual and modified bases, where biophysical 
methods yield unreliable results. A major advantage of this 
procedure is the ability to analyze nanoeramquantities of 
DNA : this allows a study of rare DNA species, such as certain - 
plasmids. and the chromosomes of viruses, cell organelles and 
fastidious organisms. 



MATERIALS AND METHODS 

Microorganisms and viruses 

The following microorganisms were used: Bacillus subti/is 
strains 1 6S 2 (American Type Culture Collection ATCC 



476 



27639) and SB202 (J. Monod Institute. Paris. France). 
Clostridium perfringens (ATCC 3624). Escherichia coli strain 
M9{meC RNAase") (Max-PIanck-Institute. Berlin. FRG). 
Micrococcus lysodeikticus (syn. luteus) (ATCC 4698), Afyco- 
bacterium bovis BCG (Pasteur Institute. Paris), and Pseudo- 
monas aeruginosa (ATCC 10145). Growth was carried out 
under standard conditions (ATCC cataloaue of strains I XV 
edition. 1982. Rockville, MD, USA). 

Phage T2 was multiplied in £. constrain B (ATCC 1 1303). 
and phages 2C and SP01 in B. subtilis strain 168/2. 

Preparation of DNA 

For extraction of bacterial DNA, 10 ma (dry weight) of 
ceils suspended in 500 nl 150 mM NaCl.~10mM EDTA. 
50 mM Tris/HCI pH 7.8 were incubated 14 h at 50 ; C with 
0.5 mg lysozyme. 1 h at 37 =C with 0.5 mg pronase and 1 h at 
37 : C with 1% sodium dodecyl sulfated Mixtures were re- 
peatedly extracted with chloroform/isoamyl alcohol = 24:1 
(vv) mixture fa procedure similar to that described by 
Marmur) [15]. water-saturated phenol and ether. After incuba"- 
tion with RKase A from bovine pancreas (25 U2/500 ul I h 
37 C), DNA was fractionated by exclusion chromatography 
on a Sephadex G-50 column previously equilibrated with 
4.S mM sodium phosphate buffer pH 7.8.' Fractions eluted in 
the void volume were purified on hydroxyapatite columns 
which were successively washed with 0. 1 M sodium phosphate 
buffer pH 7.8 containing 1% (w/v) sodium dodecyl sulfate 
and 8 M urea, and with 4.8 mM sodium phosphate bufTer 
pH 7.8. and eluted with 480 mM sodium phosphate buffer 
pH 7.S. Final purification was achieved on Sephadex G-50 
columns previously equilibrated with 0.1 mM EDTA. 10 mM 
Tris-HCI buffer pH 7.8. This procedure proved suitable for 
extraction and purification of certain kinds of DNA (such as 
those from mycobacteria), which can hardly be obtained in a 
pure form with other procedures. 

DNA from phages T2 and 2C was obtained by water- 
saturated phenol and ether extraction, as previously described 
[16. 17], 

Some experiments were performed with purified DNA of 
commercial sources: thus, the DNA of Clostridium per- 
frinzens. virus T4 and calf thymus were obtained from Siama 
(Sc Louis. ML USA), methylated-adenine-lree DNA of phace 
dam' (from £. coli /. strain CI 857) was purchased from 
Genofit (Geneva. Switzerland), and pBR322 DNA was from 
Boehringer (Mannheim. FRG). 

Polymerase-directed synthesis of labeled DNA 

Reaction mixtures for the synthesis of double-labeled 
DNA -had the following composition: 7.5 mM MsCi>. 
0.75 mM dithiothreitol. 30 mM Tris/HCI buffer pFf 7.4~: 
5.25 uM dGTP. dTTP and dATP (Boehrineer, Mannheim 
FRG): 5.25 uM [5- 3 H]dCTP*(28.5 Ci/mmoL New England 
Nuclear, Dreieich. FRG). 0.004 uM [a- 3 -P]dATP (3200 Ci. 
mmol. New England Nuclear), and 50 units/ml DNA poly- 
merase I (type 104485 of Boehringer, an endonuclease-con- 
taining preparation). Note that the concentrations of the four 
deoxyribonucleoside triphosphates were similar as were the 
radioactivity levels of the two labeled precursors. 

Polymerization was started by the addition of DNA (5 ul 
of a solution containing 100 ugDNA/ml added to 10 ul of 
the reaction mixture), and pursued for 25 min at 20 "C. Reac- 
tion was stopped by dilution with 10 mM Tris/HCI pH 7.4 
10 mM EDTA. and addition of ice-cold 10% trichloroacetic 



acid containing 1% bovine serum albumin. Samples were 
ttitered through glass-fiber niters (MN85, Macherey-Na°el 
Duren. FRG). which were subsequently washed with 5% 
trichloroacetic acid. Filters were air dried and counted for 
ooth H and -P in a scintillation spectrometer: from these 
values the background values (samples without DNA which 
were treated in similar way) were subtracted. Correction was 
made for 32 P" contamination of the 3 H counting channel. 

Calculation of the G -r C value 

In double-stranded DNA the G + C content is related ti- 
the ratio of adenine to cvtosine: 



% G + C 



[Gl + IQ 



(A] + [T] + [G) t [C] 
100 



x 100 



100 



[enter [C] +1 

The adenine-to-cytosine ratio was obtained from the 
trichloroacetic-acid-precipitable radioactivity accordine to 
the equation: 



G + C = 



100 



/ JP1 + 1 



The K value was determined for each experiment by usina 
several reference DNAs of known composition. It is impor- 
tant to note that this radioactivity ratio indicated the 
percentages of four nucleotides (moI/100 mol). 

By plotting l/(% G + C) against the J2 P/ 3 H ratio, a linear 
relationship was obtained, which allowed the use of linear 
regression. Accordingly, the relation I/(%G~C) = 0.01 
A'[ J2 P].[ J H] -i- 0.01 was used, whereby 0.01 K was the slope, 
and 0.01 the ordinate intercept value. 



RESULTS 

Determination of the G + C content of different DNA species 

The application range of the double-labelling procedure 
for percentage G -f- C determination was assessed by sub- 
mitting to this analytical procedure DNA species widely 
differing in their composition and structure. The following 
parameters were considered: (a) source (eucaryotes, pro"- 
caryotes. plasmids and viruses): (b) conformation (linear, 
circular and supertwisied molecules); (c) bases (usual, un- . 
usual, modified and unmodified nucleotides). Moreover, this 
study was carried out with 12 reference DNA species, which 
were previously analyzed in several laboratories by use of 
both chemical and biophysical techniques [18, 19]. Samples 
were chosen as to cover the entire ranae of G + C values from 
25% to 75%. 

Data reported in Table 1 indicate a good correspondence * 
(deviation lesser than 1.9%) between the average values found 
in the literature and those established by the double-labelling 
procedure. Moreover, with exclusion of Pseudomonas DNA 
the range of variation for five determinations, repeated with 
the same species of procaryotic and viral DNA. was less than 
1.5% G-f C Other conclusions stemming from data in Table 1 
include the observation that this procedure applies equally * 
well to DNA species encompassing the whole ranee of bac- 1 
terial DNA G-i-C values (26-72% G + C) and of different 



Tibli I . Guanine-plus-cytosine values of different DNAs 

>> G -r C values are averages of five exoeriments. 'Differ-:** is th<M 

between literature and experimental data 



Source of DNA 
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Fig. 2. Kinetics of incorporation of deoxynucleoside triphosphates into 
D.\A m double-labelling experiments. E.coli AI9 DNA was incubated 
with DNA polymerase I.dGTP. dTTP. [5- J H]dCTP. and [i-«P]d ATP 
<ct. Materials and Methods). At different times, samples were 
withdrawn, and the '-R"H ratio in trichloroacetic-add-precipitatcs 
was determined. Symbols: J -P (O). J H (A), and % G-C(*) 



J ! 

3 
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Fig. I. Relationship between KG+C value and >-p >H ratio from 
ununlcdabcthnz experiments. The folio winu sources of D\*A were 
considered: M. lysodeikticus (I). P. aeruginosa (2) \f bovi, t\\ 
phinud P BR322 of E. co/i (4). E. «>ti (5,. pha,e ;. of Eeoii (6) V 
ynnhs (7). call thymus (N). phage 2C of A .v,/A/,Y,> <V). phj. tf of 
( ,0K P h:i ^ c T4 of E. voii (II) and C pcrfrinzens . I C j D\.\ 
solutions were nieubated with DNA polymerase I in a reaction 
■mixture containing the four dcoxyribonuclcotides with [^HldCTP 
tu J '^ dATP " lab0lied P rccursors «cr. Materials and Methods) 
I he -P. H tnchloroacciic-aad-prccipiiableradioaciiviiv values were 
plotted against the % G 4- C literature values. Deviation in five ex- 
periments is reported. The corresponding % G + C values were 
obtained Irom the relationship G + C ) = 0 0*>044 f- % -pi Mm 4- 
0.00967 1 



conformation (circular forms of plasmids as well as linear 
molecules of cellular and viral DNA). Even more imoortant 
is the demonstration that the double-labellins method is a 
suitable tool for analysis of DNA carrying modified bases 
(bacterial DNA containing methvlcytosine and methvl- 
adenme). unusual bases (phage 2C containine hvdroxvmech- 
yluracii in place of thymine) and unusual modified bases 
(T-even phages, in which cytosine is replaced bv 
hydroxymcthylcytosine carrying one or two molecules of 
glucose). 

Linearity of the relationship between the double-labelline 
determination and the reciprocal value of : „ C - G arrears 
evident from the graph in Fig. I. The latter was obtained bv 



plotting the [ J -P] [ J H] values of radioactivity determination 
against the !/(% C-rC) values in the literature. In this graph 
the variation range for live determinations is displayed. 

Kinetics of DNA double-labelling 

Although data in the previous section indicate an 
unrestricted applicability of the double-labelline technique to 
analysis oi double-stranded DNA irrespective^ its origin, 
conformation and base composition, it is still possible that 
polymerase I displays a preferential selection of certain 
domains within a given DNA molecule. If this were the case 
a carelul timing of the length of labelling would be an essential 
prerequisite for the analytical procedure, since variable results 
are expected in the course of the polymerisation reaction. This 
problem was approached by submitting E. colt DNA to the 
double-labelling procedure. Samples of the reaction mixture 
were withdrawn at different intervals, and the % G + C values 
were determined. 

Data reported in Fig. 2 indicate that incorporation of 
both -P and H in the course of the polymerization reaction 
increases lor about 20 min and remains at a plateau level for 
about I h. Consequently a standard incubation time of 25 min 
was chosen for the experiments reported in the present work. 
The corresponding % G + C values were reasonably close to 
a straight horizontal line. These results support the inference 
that faithful %GrC values are expected from the double- 
labelling procedure, irrespective of the length of the reaction. 

Minimum size of DNA 

In view of the expected heterogeneity of the eenome. the - 
occurrence of a critical size would reasonably bepostulated. 
This problem was tackled by submitting DNA to periods of 
sonication of different duration. The size of the fragments 
was evaluated by electrophoresis in the presence of molecular 
mass standards. Unsonicated (150000 base pairs) and' 
sonicated (down to 100 base pairs) 2C DNA was submitted 
to double-labelling procedure in the presence of two reference 
DNAsjC. pcrjringens and .1/. lysodeikticus). As shown in 
:ablel DNA segments iarger than J00 base pairs, yielded 
G -r C values close to those of the whole genome. 



Table 2. .Analysis ofDSA of different sizes 
3 G + c va! ' J « are averages ol'chree experiments 

Source of DMA Size 

(base pairs! 



»G + C 



data_ purification on hydroxyapatite columns in the presence 
ot detergents ana 8 M urea is a „ essential step lb 
mycobacterial ane corynebacterial DNA purification ' 









moi. iuu moi 




150000 




37.0 ± 0.2 




405- 


315 


36.7 + 0.6 




405 - 


2S5 


36.9 + 2.2 




240- 


90 


37.2 ± 2.3 




< 120 




34.1 ± 1.9 


C. perfrinzens 


150000- 


1500 


26.5 ± 1.2 


\l. lysotkikticus 


100000- 


15000 


7I.S ± 12 



lable 3. Sensitivity of the procedure for %G + C determination 
. . O + C values are averages of five experiments. Labellins efficiency 
■> the ratio (/„) of the trichloroacetic-acid-precipitable radioaciivitv 
to total radioactivity (background value = 0.5% for 3: P and 0.6% 



Source of DN'A Quantity % CrC 



Efficiency 
of labelling 





ng 


moi: 100 moi 


. » 




E. colt 


150 


49.9 + 2.3 


3S.4 


54.8 




50 


49.7 ± 1.5 


21.7 


30.6 




15 


50.2 ± 1.2 


15 


113 






4S.7 + 0.6 


; "> 


4.4 




1.5 


50.0 ± 2.4 


1.4 


2.0 


C. pcr/ringats 


500 


26.6 ± 1.7 


10.2 


5.4 


M. lystulcik tints 


500 


71.9 ± 1.4 


5.1 


18.1 



Sensitivity of the procedure 

The sensitivity of the method was evaluated bv 

fAvT'?? lhe G * C Value 0, ' a uide ran S e ^"concentrations 
ol DNA. Five quantities of E. eoli DNA and a control without 
UNA were analyzed with two reference DXAs[C.perJringc„s 
and M lysodeikticus). From data in Table 3 it can be inferred 
tnat the double-labelling procedure allows the G - C value of 
nanogram quantities of DNA to be determined with precision. 

DISCUSSION 

In-Materials and Methods a procedure for extraction and 
purification of bacterial DNA is detailed. This procedure is 
particularly suitable for isolating the genome of microorgan- 
isms or the CMN (Corynehacieritmt. Mycobacterium and 
Socardta) group, which share an unusualiv toush cell wall 
resistant to both mechanical disruption and enzvmatic dioes- 
non Moreover, the difficulty of obtainins mycobacterial 
UNA tree of contaminating polysaccharides and proteins has 
been repeatedly related [20]. In fact, these contaminations 
interfere with both % C+C determination bv biophysical 
procedures and template function of DNA in duplication and 
transcription. This justifies the repeated cvcles of purification 
ot bacterial DNA by column chromatography. The impor- 
• ; :nce oi nyaroxyapatite chromatography for DN'A purifica- 
tion has been well substantiated (21. 22]. According to our 



The procedure for % G +• C determination: which is de- 

Scfion'^xrV'"' 5 ? 1 WOr! \ iS b T d ° n the Polymerization 
reaction. DNA - deoxynucleos.de i'-triphosohate - DN'A 
(deoxynucleotides /,+ !)- p.. P| , which is catalyzed bv DN'A 
polymerase I ol £. coli and produces a chain growth in"th- 
^ - j direction, whereby the last nucleotide is'added to the 
-hydroxyl end of the chain (/tail' type of growth). The 
template for this sort of reaction ought to be either nicked or 
gapped double-stranded DNA. since intact duplexes, either 
linear or circular, are unsuitable as template-primers for 
polymerase I. For this reason a preparation of DNA poly- 
merase I. containing a well-established quantity of endo- 

"" C ff; * as US£d lh£ present work ( cf - Materials and 
Methods). Conversely, the use of pure DNA polymerase I 
plus a known amount of pancreatic DNase I ou°ht to be 
considered. DNA polymerase I possesses also a 5 f -3' ex- 
onuclease act.on. supposedly directed toward removal of 
UNA segments damaged by radiation or altered by mutagens 
l-JJ: tne latter activity, in conjunction with a primer-depen- 
dent polymerization in the 5- -3- direction, is responsible 
lor the n.ck translation' [24]. on which the double-labellin- 
procedure described in the present work is based. 

The %GrC value was determined from the labelling 
ratio or the A-T pairs with a radioactive marker, and of the 

Sm^o W an0Cher - the P rescm work ("PldATP and 
[ HdCTP were used, the choice of a very strong and a verv 
weak beta emitter being the most favorable for spectromctri'c 
determinations. Although there is no a priori discrimination 
against the use ol "C and J1 S-labcIled nucleotide precursors 
contamination of the J H channel bv either J5 S or 1J C is 
expected to be almost twentyfold higher than that by J: P 
(30 /o contamination level, instead of 2.5%). Also it must be 
tressed that the precision of our method requires the use of 
comparaole amounts of radioactivity for the two kinds of 
precursor. Indeed, a too high S1 P radioactivity level produces 
an exceedingly heavy contamination of the J H channel and 
decreases the precision. of % C + G measurements 

The occurrence of modified bases in DNA of both pro- 
caryotes (2^-27) and eucaryotes [23] is well established. 
Mod.tied bases entail the obtcntion of abnormal buovant 
density and , m data, which lead to erroneous G + C values 
Thus, for example, the occurrence of I % methvlated cytosine - 
produces an increase 0 r, m ofO.IS C(thus raising the apparent 
o u -r L o| 0.j2). and a decrease in buoyant density of 
U.4S mg cm (corresponding to a % G + C 0.48 higher) 
(values calculated after [29]). Unlike these biophysical tech- 
niques, tne double-labelling method allows faithful % G + C 
determinations even in the presence of modified and unusual - 
bases (cl. Table 1). 

The percentage G H- C value can be calculated in several 
ways trorn the experimental 3 -p/-»H ratios. In the eraphic 
mode ., G + C is obtained by intrapolation using a graph 
like that in Fig. 1. where the radioactivities ratios are plotted 
against the !/(% G -=■ C) values. More precise is a computing 
approacn based on the relation 1/(%G + C) = a[ i2 P]i 
[;H] + />. This consists in calculating a regression line from 
lew reference DNAs of known base composition and covering 
the whole /, G + C range. In practice a couple of extreme 
standarcs. such as C. perfringeiu DNA (26.5% G ■>- C) and 
U. lysodeikticus DNA (71.9% G + C) are sufficient Tor 
calculating w.th precision the slope or the line and the value 
~' us intersection with the ordinate axis. Further simplifica- 



ot 



' don can be obtained by assuming a valJ. of 0.01 for the 
ordinate intercept, and measuring*" the % G -i- C value of a 
unique reference DNA: the lower the % G-rC value of refer- 
ence DNA. the higher the precision by which the curve slope 
is measured. 

A main objection to the procedure outlined in this paper 
■ is that polymerase I might nick-translate preferentially DNA 
segments of a given composition. Since, however, this proce- 
dure was proven to apply equally well to DNA of quite dif- 
ferent G 4- C content (Fig. i). it can be inferred that the pos- 
sible heterogeneity of the genome does not stand out. when 
analysis is carried out on sufficiently long DNA sesrments. 
This finding agrees with the report that DNA polymerase I 
hus no preferential duplication sites [24] (which is also appli- 
cable, our data show, to DNA with unusual bases). Analysis 
o( small DNA segments of different composition might' be 
able to disclose a microheterogeneity of DNA: however, data 
in Table 2 indicate that the limit of applicability of our proce- 
dure is about 120 base pairs. 

The double-labelling procedure for % G + C determina- 
tion presents the following advantages with respect to the 
available methods: (a) it is much shorter (buoyant densitv 
determinations by centrifugation in CsC! gradients require 
cays): (b) it is relatively inexpensive (compared to the costlv 
equipment required for / ra and buoyant density measure- 
ments): (c) it is not limited by the presence of modified and 
unusual bases nor by the occurrence of base-linked hexoses 
(as are / M and buoyant density determinations); and (d) it 
requires nanogram quantities of DNA (microgram amounts 
of DNA are required for chromatographic analysis of DNA 
hydrolysates). Since % GrC determination by biophvsical 
procedures is always uncertain without parallel chemical 
analysis (which requires large amounts of material), the 
double-labelling procedure outlined in the present paper is 
the method of choice when very small amounts of DNA are 
available. 
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